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Abstract — In this paper we study the sum rate of zero-forcing 
(ZF) as well as regularized ZF (RZF) precoding in large MISO 
broadcast channels, under the assumptions of imperfect channel 
state information at the transmitter, channel transmit correlation 
and different user path losses. Our analysis assumes that the 
number of transmit antennas M and the number of users K 
are large and of the same order of magnitude. We apply recent 
results on the empirical spectral distribution of certain kinds 
of large dimensional random matrices to derive deterministic 
equivalents for the signal-to-interference plus noise ratio (SINR) 
at the receivers. Based on these results and under sum rate 
maximization, we evaluate for RZF (i) the optimal precoder, 
for ZF (ii) the optimal number of active users and (iii) the 
optimal amount of channel training in TDD multi-user systems. 
Moreover, we study the achievable sum rate under limited 
feedback and derive an approximation of the feedback rate 
required to maintain a given rate offset relative to perfect CSIT. 
Numerical simulations suggest that the approximations, almost 
surely exact as M, K ^oo, are accurate even for small A/, K. 

Index Terms — Broadcast channel, random matrix theory, lin- 
ear precoding, limited feedback, multi-user. 

I. Introduction 

THE pioneering work in [1] and [2] revealed that the 
capacity of a point-to-point (single-user (SU)) multiple- 
input multiple-output (MIMO) channel can potentially in- 
crease linearly with the number of antennas. However, practi- 
cal implementations quickly demonstrated that in most prop- 
agation environments the promised capacity gain of SU- 
MIMO is unachievable due to antenna correlation and line- 
of-sight components [3]. In a multi-user scenario, the inherent 
problems of SU-MIMO transmission can largely be overcome 
by exploiting multi-user (MU) diversity, i.e. sharing the spatial 
dimension not only between the antennas of a single receiver, 
but among multiple (non-cooperative) users. The underlying 
channel for MU-MIMO transmission is referred to as the 
MIMO broadcast channel (BC) or MU downUnk channel. Al- 
though much more robust to channel correlation, the MIMO- 
BC suffers from inter-user interference at the receivers which 
can only be efficiently mitigated by appropriate (i.e. channel- 
aware) pre-processing at the transmitter 

It has been proved that dirty-paper coding (DPC) is a 
capacity achieving precoding strategy for the Gaussian MIMO- 
BC [4]-[8]. But the DPC precoder is non-linear and to this 
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day too complex to be implemented efficiently in practical 
systems. However, it has been shown in [4], [9]-[ll], that 
suboptimal linear precoders can achieve a large portion of the 
BC rate region while featuring low computational complexity. 
Thus, a lot of research has recently focused on linear precoding 
strategies. 

In general, the rate maximizing linear precoder has no 
explicit form. Several iterative algorithms have been proposed 
in [12], [13], but no global convergence has been proved. 
Still, these iterative algorithms have a high computational com- 
plexity which motivates the use of further suboptimal linear 
transmit filters (i.e. precoders), by imposing more structure 
into the filter design. A straightforward technique is to precode 
by the inverse of the channel matrix. This scheme is usually 
referred to as channel inversion or zero-forcing (ZF) [4]. 
Similar to the approach pursued in the present contribution, 
the authors in [14], [15] carry out a large system analysis 
assuming that the number of transmit antennas M as well as 
the number of users K grow large while their ratio /3 = M /K 
remains bounded. It is shown in [14] that for lim m l3 > 1, 
ZF achieves a large fraction of the linear (w.r.t. K) sum rate 
growth. The work in [9] extends the analysis in [14] to the case 
/3 = 1 and shows that the sum rate of ZF is constant in K as 

— > oo; the linear sum rate growth is lost due to the large ratio 
of the maximum to minimum singular value of the channel 
matrix. The authors in [9] counter this problem by introducing 
a regularization term into the inverse of the channel matrix. 
Under the assumption of large K and for any rotationally- 
invariant channel distribution, [9] derives the regularization 
term that maximizes the signal-to-interference plus noise ratio 
(SINR). In this article, the resulting regularized ZF (RZF) 
precoders are referred to as channel distortion-unaware RZF 
(RZF-CDU), since their design assumes perfect channel state 
information at the transmitter (CSIT). It has been observed 
that the RZF-CDU is very similar to the transmit filter derived 
under the minimum mean square error (MMSE) criterion [16] 
and both become identical in the large K limit. Likewise, we 
will observe some similarities between RZF and MMSE filters 
when considering imperfect channel state information at the 
transmitter (CSIT). 

In the large system limit, i.e. M — > oo, and for inde- 
pendent and identically distributed (i.i.d.) channels the cross 
correlations between the user channels, and therefore the user 
SINRs, are identical. It has been shown in [17] that for 
this symmetric case and equal noise variances, the SINR 
maximizing precoder is of closed form and coincides with 
RZF precoder This asymptotic optimality motivates a detailed 
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analysis of the RZF precoder for large system dimensions. 
The RZF-CDU has been analyzed for i.i.d. channels in a 
large system in [18] for a single-cell setup and in [19J for 
a two-cell system. In [18] the authors derive the optimal 
regularization parameter using an asymptotic expression for 
the SINR. The expressions in [18] are a special case of 
the results derived in this paper. However, the approach and 
the tools for the derivations of the asymptotic SINR in [18] 
have been independently introduced earlier in the context 
of a different RZF precoder [20], where the regularizalion 
parameter is set to fulfill the total transmit power constraint. 

Although it is legitimate that [9], [12], [13], [16] assume 
perfect CSIT to determine theoretically optimal performance, 
this assumption is untenable in practice. Also, it is a particu- 
larly strong assumption, since the performance of all precoding 
strategies is crucially depending on the CSIT. In practical 
systems, the transmitter has to acquire the channel state infor- 
mation (CSI) of the downlink channel by feedback signaling 
from the uplink. Since in practice the channel coherence time 
is finite, the information of the instantaneous channel state is 
inherently incomplete. For this reason, a lot of research has 
been carried out to understand the impact of imperfect CSIT 
on the system behavior, cf. [21] for a recent survey. 

An information theoretic analysis of the impact of imperfect 
CSIT on the achievable rate of a ZF precoded MU-MISO 
downlink channel with (3=1 has been carried out in [22]. 
Hereby, the author derives an upper bound on the per-user 
rate gap between perfect CSIT and imperfect CSIT under 
random vector quantization (RVQ) with B feedback bits per 
user. Under finite-rate feedback, both [22] and [23] observe 
a sum-rate ceiling for high signal-to-noise ratios (SNR). In 
[22, Theorem 3] provides a formula for the minimum scaUng 
of B to maintain a per-user rate gap of log2 b bits/s/Hz. 
Although derived for ZF, the author claims that for all SNR, 
[22, Theorem 3] is even more accurate for the RZF-CDU 
proposed in [9]. 

This work extends the results in [9], [14], [18], [22], [23] by 
applying novel results of large dimensional random matrices 
to derive deterministic approximations of the SINR under ZF 
and RZF precoding. These approximations are referred to 
as deterministic equivalents as they are independent of the 
stochastic parameters of the system model. Moreover, these 
deterministic equivalents match the true SINR, almost surely, 
for asymptotically large K. Also, as corroborated by numerical 
results in Section VII-B, they approximate the true SINR very 
accurately even for small K. The communication channel 
correlation is modeled as Kronecker to account for transmit 
correlation and different user path losses. In this general 
case, these deterministic equivalents do not have a closed 
form expression but are the solution of an implicit equation. 
For uncorrelated channels and equal path losses though, the 
deterministic equivalents have a closed form expression. 

This framework directly extends the analysis for ZF in 
[14] and [18] to include transmit correlation, path losses 
and imperfect CSIT. Furthermore, our approach allows for a 
unification and extension of the RZF analysis in [9], [18], 
[22]. In particular, we optinnize the RZF-CDU proposed in 
[9], [18], where the optimal regularization term is the solution 



to an implicit equation and has a closed form for uncorrelated 
signals and equal path losses. This optimal RZF precoder is 
referred to as ORZF. Moreover, for RZF and ZF, to maintain a 
per-user rate offset of log2 b bits/s/Hz, we derive the required 
scaling laws of the distortion of the CSIT as a function of the 
SNR. Under RVQ, these results extend [22, Theorem 3]. 
Our main contributions can be summarized as follows: 

• We derive deterministic equivalents for the SINR of ZF 
(M > K) and RZF (M > K) precoders, i.e. deterministic 
approximations of the SINR, which are independent of 
the individual channel realizations, and asymptotically 
(almost surely) exact as M, if — >^ oo. Numerical results 
prove that these approximations are accurate even for 
finite {K, M). 

• The sum rate of the ORZF saturates under limited feed- 
back at asymptotically high SNR, and we determine the 
sum rate saturation value. 

• Under RVQ, for ^ = 1 and high SNR p, to maintain 
a per-user rate offset of log2 b bits/s/Hz, the number of 
feedback bits B per user has to scale approximately with 

- ORZF: B = {M-l) Xog^ p-{M- 1) log2(62 - 1) 

- RZF-CDU: B= (M-1) log2 p-(M-l) log2 2(5-1) 
That is, the ORZF requires (Af- 1) log ^ bits less than 
the former RZF-CDU and (M - 1) log(6 -|- 1) bits less 
than ZF. 

The remainder of the paper is organized as follows. Section 
II introduces tools from random matrix theory essential to our 
subsequent derivations. Section III presents the communica- 
tion channel model. In Section IV, we derive deterministic 
equivalents for the sum rate of RZF and ZF. In Section V, 
we analyze the sum rate under limited feedback. Section 
VI studies several practical applications and discusses the 
aftermath of our derivations. Section VII presents numerical 
results comparing the theoretical findings to Monte-Carlo 
simulations. Finally, in Section VIII, we summarize our results 
and conclude the paper Note that Appendix IV collects a set 
of lemmas and a corollary that will be used throughout the 
paper. 

Notation: In the following boldface lower-case and upper- 
case characters denote vectors and matrices, respectively. The 
operators (•)", tr(-) and, for X of size iVx AT, TrX=;^trX 
and ]|X|| denote conjugate transpose, trace, normalized matrix 
trace and spectral norm, respectively. The expectation is E[] 
and diag(a;i, . . . , a;jv) is the diagonal matrix with ith diagonal 
entry xi. The Nx N identity matrix is I at and ^[z] is the 
imaginary part of zGC. 

II. Mathematical Preliminaries 

In the present work we are interested in deterministic 
equivalents of functionals of matrices of the form 

m-Bfi,Qi^{z) = TrQN (Bn - zIn)~^ , (1) 

where Qn & C^^^ is a Hermitian positive definite matrix 
and BjvSC^^^ is of the type 

Bjv = R]y^^X'j^TjvXjvR](/^ + Sjv, (2) 
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where Rjv, Sjv G 

»l/2 • 



are nonnegative definite Hermitian 



matrices, R^^ is a Hermitian square-root ofRAr,TAreC"^" 
is a nonnegative definite diagonal matrix and X^r € C"^^ is 
random with i.i.d. entries of zero mean and variance 1 /N. In 
the course of the derivations, we will require the following 
result, 

Theorem 1: Let Bat be defined as in (2), where we assume 
that Tjv, Rjv and Qat have uniformly bounded spectral norm 
(with respect to N), as {n,N) grow large with ratio f3[N) = 
N/n such that < liminfjv /3(A^) < limsupjy ;S(Af) < oo. 
Define mBjv.Qwl-^) ^ Then, for zed 



N- 



0, 



(3) 
(4) 

(5) 
(6) 



almost surely, with 'm^^^Q^{z) given by 

"^Bjv.Qjvl^) = TrQjv (c(^)Rjv + Sjv - zIn)' 
and e{z) is the unique solution of 

e{z) = TrRjv {c{z)Kn + Sjv - ^Ijv)"^ , 
c{z) = ^TrTjv (I„ + e(^)Tjv)"^ 

with positive imaginary part if Q[z] > 0, with negative 
imaginary part if ?s[z] <0, or real positive if z<0. Moreover 
e{z) is analytic on C\R+ and of uniformly bounded module 
on every compact subset of C 
The proof of Theorem 1 is provided in Appendix I. 

We denote m-B^rj^^,{z) =777.3^,, (z) the Stieltjes transform 
[24] of the eigenvalue distribution of B^r, used in random 
matrix theory, e.g. by Marcenko and Pastur [25] to derive 
the limiting distribution of sample covariance matrices (which 
corresponds here to the case QAr = RAr = Sjv = lAr). 

Remark 1: If Xjv is Gaussian, besides (3), the authors infer 
that, by applying the Gaussian method [26], it can be shown 
that 

almost surely. This result is outside the scope of this paper 
and will not be proved. 

For practical purposes, we prove hereafter that the implicit 
equation (5) can be solved numerically, when 2; < 0; the param- 
eter z will in particular be linked to the regularization factor 
for RZF precoders. The algorithm known as the fixed-point 
algorithm, when properly initialized, is shown to converge 
surely to the unique positive solution of (5). 

Proposition 1: Let 2: < 0. Define the sequence Cq, ei, . . . as 
0<eo<-l/z and, for k>0, 

-1 



efc+i = TrRjv (cfcRAr -l- Sjv - zIn) 

Ck = ^TrTjv (I„ + efeTAr)-^ . 



(8) 

(9) 



Then the sequence eo,ei,... converges surely to e{z), the 
unique solution to (5) in Theorem 1. 

The proof of Proposition 1 is provided in Appendix 11. 

III. System Model 

This section describes the transmission model as well as the 
underlying channel model. 



A. Transmission Model 

Consider the MISO broadcast channel composed of a central 
transmitter equipped with M antennas and of K single- 
antenna receivers. Assume narrow-band connmunication. De- 
noting yk the signal received by user k, the concatenated 
received signal vector y= [j/i, . . . , yx]^ gC^ at a given time 
instant reads 

y = \/MHGs + n (10) 

with symbol vector s = [si, . . . , sk]^ ^CAfiO, Ik), precoding 
matrix G = [gi , . . . , g^] € ^ , channel matrix H e ^ ^ 
and noise vector n= [m, . . . , tik]'^ '^CJ\f{0, o'^Ik)- Assuming 
a total transmit power P > 0, 



tr(£;[Gss"G"]) = tr(GG") < P. 



(11) 



We define the SNR p as p = P/a^. The received symbol 
yk of user k is given by 

K 

j/fe = \/Mh^gfcSfc \/M ^ Yi^^iSi+Hk, (12) 

where e denotes the fcth row of H. The SINR 7^ of 
user k reads 



Ik = 



M 2 

The system sum rate -Rgum is defined as 



(13) 



K 

urn — ^ 
fc=l 



log2(l + 7ft) [bits/s/Hz]. 



(14) 



B. Channel Model 

Under the assumption of a rich scattering envirormient, the 
correlated channel can be modeled as [27]-[29] 

H = LV2X0V2, (15) 

where X e C^^^ has i.i.d. zero-mean entries of variance 
1/M, e C^^*^ is the nonnegative definite correlation 
matrix at the transmitter with eigenvalues Ai , . . . , Am, ordered 
as Ai < . . . < Am, and L = diag(li, . . . , Ik), with entries 
ordered as, l\ < . . . <Ik, contains the user channel gains, i.e. 
the inverse user path losses. Note that in (15) the entries of X 
are not required to be Gaussian. We further assume that there 
exist a; , a„ > such that. 



a; < Ai < Am < flu, 
ai < h <Ik < ciu 



(16) 
(17) 



uniformly on M and K. That is, (16) assumes that the 
correlation between transmit antennas does not increase as the 
number of antennas increases. For practical finite dimensional 
systems, this is equivalent to requesting that neighboring 
antennas are spaced sufficiently apart. Equation (17) assumes 
that the users are not too close to the base station but not too 
far away either; this is a reahstic assumption, as distant users 
would be served by neighboring base stations. Those require- 
ments, although rather realistic, are obviously not mandatory 
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for practical systems; however, they are required for the 
mathematical derivations of the present article. 

Note that field measurements [30] suggest that a user- 
invariant correlation matrix is not a fully realistic as- 
sumption. Signal correlation at the transmitter does not only 
arise from close antenna spacing but also from the channel 
diversity and more specifically from the distribution of the 
soUd angles of departure of effectively received energy. It 
could be argued though, that the scenario where all users 
experience equal transmit covariance matrices represents a 
worst case scenario, as it reduces multi-user diversity. If not 
fully realistic, the current assumption on is therefore still 
an interesting hypothesis. Further note that (15) assumes that 
the receivers are spaced sufficiently apart and are therefore 
spatially uncorrelated, an assumption which could also be 
argued against in some specific scenarios. 

Besides, we suppose that only H, an imperfect estimate of 
the true channel matrix H, is available at the transmitter. The 
channel gain matrix L as well as the transmit correlation 
are assumed to be slowly varying compared to the channel 
coherence time and are assumed to be perfectly known to the 
transmitter. We model H as 



(18) 



H = LV2X0V2 
with imperfect short-term statistics X of the form 

X = diag Ui-tI, ^^-T%j X+diag (n, . ..,tk) Q, 

(19) 

where Q eC^^*^ is the matrix of channel estimation errors 
containing i.i.d. entries of zero mean and variance 1/M, and 
Tfe e [0, 1]. The parameters Tk reflect the amount of distortion 
in the channel estimate hfc of user k. We assume that the 
Tfc are perfectly known at the transmitter. However, as shown 
in [31], an approximate knowledge of Tk will not lead to a 
severe performance degradation of the system. Furthermore, 
we suppose that X and Q are mutually independent as well 
as independent of the symbol vector s and noise vector n. A 
similar model for the imperfect CSIT has been used in [31]- 
[33]. 

Note that the assumption of not necessarily Gaussian entries 
of channel matrix X can be useful in characterizing the aspect 
of quantization of the channel estimate, which, in practice, is 
not necessarily Gaussian. 

IV. A Deterministic Equivalent of the SINR 

In the following we derive a deterministic equivalent 7^ of 
the SINR 7fe of user k. That is, 7^ is an approximation of jk 
independent of the particular reahzations of X, Q, and is such 
that, 

o M- 



0, 



(20) 



almost surely. We proceed by deriving 7^rzf P'"^" 
coding with regularization parameter a and subsequently let 
a — > to obtain 7^ for ZF precoders. 

Several known results of random matrix theory (RMT) that 
we apply during the derivations are recaUed in Appendix IV. 



A. Regularized Zero-forcing Precoding 
Consider the RZF precoding matrix 



G 



rzf 



(^MH"H + Main 



(21) 



where we remind that H is the estimated channel matrix 
available at the transmitter and the scaling factor ^ is set to 
fulfill the power constraint (11). The regularization scalar q > 
in (21) is scaled by M to ensure that, as [K, M) grow large, 
both trMH^H and trMalM grow with the same order of 
magnitude. 

Theorem 2: Let 7fe,rzf be the SINR of user k for RZF 
precoding. Then 



7fe,rzf - 7fe,rzf > 0, 



(22) 



almost surely, where 7^ ^.^j is given by 



'^fc,rzf 



with 



/feT°(l - r2[l - (1 + /fem°)2]) + 5^(1 + Zfem°)2 

(23) 



m° = Tr0 {alM + c&y 
*°(a) =cTr0 (alM + f;0) 



(24) 



ajTiL^ (Ik + Tn°L) ^ Tr0 (al 



1 - iTrL2 (I^ + m°L)-^ Tr02 {uIm + c0)"^ ' 

(25) 

T° =m° 

aTr© (alM +c0)"^ 



1 - iTrL2 (I^ + m°L)-^ Tr02 {uIm + c0)" 

where c is the unique real positive solution of 

c= ■^TVL(lK + LTr0(alM+c0)"^) \ 
Moreover, define mo = l/a, and for k>l 



Ck 



^TrL (I/f + TOfc_iL) \ 



(26) 
(27) 

(28) 
(29) 



Then c = limfe_>oo Ck- 

Proof: From the sum power constraint (11) we obtain 

P 



Mr 



(o) 



HUH (huh oIm) 
P 



m 



i-a) - am'^„^{-a) *(a)' 



(30) 



(31) 



where (a) follows from the decomposition H^H(H^H -|- 
cIm)-^ = (H"H + alM)-^ - Q!(H"H + alm)-'^, and we 
define 

*(a) = mjjHH(-a) - am'^^f^{-a) (32) 

with "^^Hjj(~0!) the derivative of m^nfi{z) w.r.t. ^; in 2; = 
—a. 
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The received symbol yk of user k is given by 



K 



= ^h^WhfcSfc + e h]^WhiSi+nfc, (33) 

where W = (H^H + alM)~^ and denotes the fcth row of 
H. The SINR 7fe,rzf of user k can be written in the form 



h^^WHH,Hr^iWhfe + i*(a) 



(34) 



L[fe]"[fel 



where Hf.j = [hi , . . . , hfc_i , h^+i , . . . , h^] G x (^"D . 

We will proceed by successively deriving determin- 
istic equivalent expressions for *I'(q:), for the signal 
power jh^Whfcp and for the power of the interference 

Consider ^'(q;) in (32). From Theorem 1, m^n^{—a) is 
close to m'^^~{—a) given by (4) as 



m 



(35) 



where c and m° are defined in (27) and (24), respectively. 

Remark 2: Note that m° in (24) is a deterministic equiva- 
lent of the Stieltjes transform mA{z) of 



A^X^LX-ha0-\ 
evaluated at z = 0, i.e. almost surely 



'tia(O) — m — > 0. 



(36) 



(37) 



It is uncommon to consider Stieltjes transforms evaluated 
at = 0. However, in our case this is valid, because we 
assumed in (16) that 1/Xm > ^/o-u and then the smallest 
eigenvalue of A is strictly greater than l/(2a„) > 0, uniformly 
on M. Therefore, toa(O) is well defined. Now mA(0) = 
TOA-i/(2a„)iM(-l/(2au))- Since A - l/(2a„)lM meets the 
conditions of Theorem 1 with = a&^^ — l/(2a„)lM, 
Tjv = L and R^v = Qn ^ ^n, one can determine a 
deterministic equivalent for 7TiA(0), which is then m°. 

Since the deterministic equivalent for the Stieltjes transform 
of H'^H is itself the Stieltjes transform of a probability 
distribution (cf. [34]), a trivial application of the dominated 
convergence theorem ensures that the derivative 'm'^„^{z) of 
Tn'^^^^{z) is a deterministic equivalent for m'^^^^{z), i.e. 



M- 



0, 



(38) 



almost surely. 

After differentiation of (35) and standard algebraic manip- 
ulations, we obtain 



*(a)-*°(Q!) 



M- 



0, 



almost surely, where 

which is exphcitly given by (25). 



(39) 



(40) 



1) A Deterministic Equivalent Of The Signal Power: Ap- 
plying Lenrnia 4 to h^W = h^(H[^^jH[fe] + oIm + hfch|^)-\ 
we have 



hfeWhfe 



h^(H^.H 



(41) 



l + h^(H{l]H[fe]+alM) hk 
Together with h" = \//fc(\/l - ^^k^k + uq")©^/^ we obtain 



1 + /feX^Aj^j Xfe 



1 ~l~ -^fc 



a0-i for X[^,j = 



with A[fc] 

[xi,...,Xfc_i,Xfc+i,.'.'. ,Xi<-], x„ being the nth row of 
X, and L[fc] = diag{li, ... ,lk-i, h+i ■■ -Ik)- Since both 
Xfc and Xfc have i.i.d. entries of variance 1/M and are 
independent of A^^ we evoke Corollary 7 and obtain, almost 
surely. 



^k^[k]^k - l^jtrA 



-1 M-^oo 



M 



'■[fe] 



it^ArJife - — trA 



-1 M^oo 



'-[fe] 



Ad 



0, 



0. 



(42) 

(43) 



Similarly, as and x^ are independent, it follows from 
Lemma 6 that 

q^A-ixfe^-^°°0, (44) 

almost surely. Consequently, since (1 -|- /fcX^A^j^x^) is 
bounded away from zero, we obtain 



^^1 + 7 1 trA"^ 



M—^oc 



(45) 



almost surely. 

We require Lerrnna 7 to prove that a rank-1 perturbation 
of A has no impact on TrA~^ for asymptotically large M. 
Rewrite A^^ as. 



'-[fe] 



] - {^^k 



[k]Hk]^[k] 



a- — Im 

2au 



(46) 



Since 1/Am > I/om uniformly on M, notice that the matrix 
in brackets on the right-hand side is still nonnegative definite. 
Thus, applying Lenrnia 7 to this matrix and the scalar > 
0, we obtain 



1 

M 



tr 



(^[fc]L[fc]X[fe] -1- 

(x[l]L[fe]X[fc] + ZfcXfeX^ + a0-i) 



Therefore, surely, 

1 



M— >-oo 



tr 



ItrA-i 0, 

M 



(47) 



where we remind that A = X^LX -|- a0 ^ . 
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Thus, (47) and (37) imply 

— trA-i - 



TO > 0, 



almost surely. Finally, (45) takes the form 

h'm" M- 



^ 1 + lkm° 



0. 



(48) 



(49) 



2} A Deterministic Equivalent Of The Interference Power: 
With W = 0~^/^A~^0~^/^, the interference power can be 
written as 

hHWHH]H[fc]Whfc =ZfeX^A-iX[^,]L[fc]X[fe]A-ixfc. (50) 
Denote co = (l -T|)/fe, ci=T^lk and C2 = Tky/l - r^^fc, then 



A = A 



[fe] 



coXfeXfc + ciqfeq^ + caXfeq" + caqfex". (51) 



In order to ehminate the dependence between x/c and A in 
(50), we rewrite (50) as 



h^WH[l]H[fc]Whfc = ZfcX^Arix[l]L[fe]X[fe]A-ixfc 



A-i 



A-i 

^[fe] 



XPfe]L[fe]X[fe]A -^Xfe. 



(52) 



Applying Lemma 8 to the term in brackets in (52) and together 
with A^j^^X[^j,jL[fe]X[fc] = Im - aAj^|0-\ equation (52) 
takes the form 



h^WHl'fejHifejWhfe = ZfcX^A-^Xfc - a/feX^A[-^[0-iA-ixfc 



- coZfeXfe A" 








- "x^A[,[0- 


^A- 


^Xfc 


- ci/feXfeA" 


^qfc 


(q^A- 


^Xfe 


- aq^Af-,[0- 


^A- 


^Xfe 


- C2/fex"A" 




(q^A- 


^Xfe 


- aqfe Af-40- 


^A- 


^Xfc 


- C2^feXfc A~ 




(x.^A- 


^Xfe 


- "x^A[~,i0- 


lA- 


^Xfe 



(53) 

To find a deterministic equivalent for all of the 14 quadratic 
terms in (53) we need the following lemma, which is an 
extension of Corollary 7. 

Lemma 1: Let U,V e C^^^ be invertible and of uni- 
formly bounded spectral norm. Let x, y € have i.i.d. com- 
plex entries of zero mean, variance and finite 8th order 
moment and be mutually independent as well as independent 
of U, V. Define co, ci, C2 G M.'^ such that cqCi — > and 
let M=;^trV~^ and u' = -^trUV~^. Then we have 

x"U (V + coxx" + ciyy" + C2xy" + C2yx") x 

u'{l + ciu) 



N—>-oo 



0, 



(coci - c^u^ + (co + ci)u + 1 
almost surely. Furthermore 

x"U (V + cqxx" + ciyy" + C2xy" + C2yx^^) y 



(54) 



— C2UU 



N 



(55) 



(cqCi - c^w^ -I- (co -I- ci)u -I- 1 
almost surely. 

The proof of Lemma 1 is left to Appendix 111. 

Denote u = TrAj^| and u' = Tr0-iAj^^. Notice that in our 
case cqCi =C2, thus (cqCi — c'2)u'^ + {cq + ci)u + 1 reduces to 



1 -|- ZfcU. Applying Lemma 1 to each of the 14 terms in (53) 
we obtain 



h^WH[l]H[fe]Whfc- 
lk{l + ciu){u — au') 
1 + hu 



lkCou{u — au') 



M— >-oo 



(56) 



almost surely, where the first term in brackets stems from the 
first line in (53) and the second term in brackets of (56) arises 
from the last four lines of equation (53). Replacing cq and Ci 
by (1 — Tk)lk and r^h, respectively and after some algebraic 
manipulation, (56) takes the form 



Ikju - au') [1 - tI (1 - (1 + lkuf)\ M- 

+ 



0, 



(57) 



almost surely. Since a rank-1 perturbation has no impact on 
TrA~^ for M^oo, we surely have 



M- 



mA(0) 
- mA2,©-i(0) 



0. 



(58) 
(59) 



The definition of TOa2.©-i(^) can be extended to z = 0, since 
A and have their largest eigenvalue belonging to a compact 
set away from zero, uniformly on M (cf. Remark 2). Denote 



Observe that 



T = toa(O) - aTOA_0-i(O). 



TOA2,©-i(0) = TOa«-i(0). 



(60) 



(61) 



Furthermore, we have mA2.©-i(0) — to^q_i(0) ^^^^^ o, 
almost surely, where to^q_i(0) is given by Theorem 1. 
Similar to the derivations of ^'°(a) leading to (25), we then 
obtain that x — T° ^-ZlS!^ q, almost surely, with T° given by 



.(0) 



am 



A,0- 



.(0), 



(62) 



whose explicit form is given by (26). Substituting u and v! in 
(57) by their respective deterministic equivalent expressions 
1 (0) , we obtain 



m° and to^q 



/fcY°(l-rfc^[l-(l + ;fcTO°)^]) M 

(l + ZfcTO°)2 



0, 



(63) 



almost surely. 

Finally, a deterministic equivalent 7^ of 7fc,rzf is given 
by (23), since the denominator of (23) is positive and bounded 
away from zero. 

Moreover, the convergence of the sequence {cfe} to c is a 
direct consequence of Proposition 1. ■ 

B. Optimal Regularized Zero-forcing Precoding 
Define to be 



r: 



■o.rzf 



K 

E- 
fe=i 



lOE 



1- 



7fe,rzf j 



[bits/s/Hz], (64) 
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where the parameter a in 7^ is chosen to be the positive 
real that maximizes i?sum^ i-^- C( = a*°, with a*° defined as 



a*° = argmax{Ki^^f}. 

c«>0 



(65) 



For the general channel model (18), a*° is a solution, s.t. 
a*° > 0, of the imphcit equation 



^ 97lr.( 1 



E 

fe 



= 0. 



(66) 



The imphcit equation (66) is not convex in a and the solution 
can be computed via a one-dimensional Une search^ Note that 
for non i.i.d. channels the RZF in (21) is not asymptotically 
optimal anymore. 

The RZF precoder with optimal regularization parameter 
a*° is called ORZF. For homogeneous networks (L = Ik) 
the user channels h^j are statistically equivalent and it is 
reasonable to assume that the distortions t| of the CSIT hfc 
are identical for all users, i.e. t = t^,. Under the additional 
assumption of uncorrelated transmit antennas (0 = 1m) , 
the solution to (65) has a closed form and leads to the 
asymptotically optimal precoder [17]. 

Proposition 2: Let = Im and L = I/f. The approximated 
SINR 7^ of user k under RZF precoding (equivalently, the 
approximated per-user rate and the sum rate) is maximized for 
a regularization term a = a*°, given by 

1-t2 J 



(67) 



Proof: For L = Ix and © = Im, *°(q!) = T° and m° = 

_,^(-a:) is the ! 
law and reads [24] 



m^^^(— a ) is the Stieltjes transform of the Marcenko-Pastur 



m. 



XHX 



(-a) = 



/3(1 - a) - 1 + d{a, , 



with d{a, /3) = V/3^a2 + 2a^(l + /?) + (1 - I3f. (68) 

Substituting (68) into (23)-(27) and setting the derivative w.r.t. 
a to zero, a real positive solution is given by (67). ■ 
Notice that for perfect CSIT (r = 0) we have a*° = l/(/3p) 
which corresponds to the RZF-CDU precoder derived in [9], 
[18]. As mentioned in [9], for large {K,M) the RZF-CDU 
precoder is identical to the MMSE precoder in [16], [31]. In 
contrast, for r > 0, the ORZF transmit filter and the MMSE 
transmit filter [31] are not identical anymore, even in the large 
M hmit. Furthermore, for r > at asymptotically high SNR 
the regularization term a*° in (67) converges to 

t2 1 

(69) 



lim a 

p^oo 



1 



d 



Thus, for asymptotically high SNR, ORZF does not converge 
to ZF precoding, in the sense that a*° does not converge to 0. 
Also note that for /3^oo, ORZF converges to ZF regardless 
of T^. The same has been observed in [31] for the MMSE 
precoder 

With (67), the SINR (23) takes the following simplified 
form. 

'However, in simulations we observe only a single maximum for a > 0. In 
this case a*° can be computed very efficiently. 



Corollary 1: Let = Im, L = Ia: and Tk = T and 7fe,rzf be 
the SINR of user k under ORZF precoding. Then 



7fe,rzf - 7fc,rzf > U, 



almost surely, where 7^ is given by 

7fc%.f = 7°zf = m°,Hx(-«*°) = - 1) + f 
where w e [0, 1] and x are given by 



X = - l)2a;2p2 + 2(1 + p)up + 1. 



(70) 



2, (71) 



(72) 
(73) 



Proof: Replace a in (23) by a*° in (67). After some 
algebraic manipulations we obtain (71). ■ 
Note that for = and a = a*°, (71) is identical to 
the asymptotic SINR derived in [18] and for the inter-cell 
interference- free system in [19]. 
For /3 = 1, equation (71) simphfies to 



1^ / ^1 

7fe,rrf = - 9 + V'^^ + I- 



(74) 



Note that, as shown in the following subsection, o-!p{l3 — 1) 
turns out to be a deterministic equivalent for the SINR of ZF 
precoding for /3 > 1. Let A7° be the SINR gap between ORZF 
and ZF precoding 

^ (75) 



CO 



A7° = -2^/5-1) 



X _ _ 
2 2 

Then for asymptotically high SNR, A7° converges to the 
following hmits 



f-i^(/3-l)-i ifrVO, 
where x is given by 



lim A7° = 

/9— J-OO 



X 



,1 



+ 2(l + /3) 



1 



1. 



(76) 



(77) 



Note that the SINR gap l/(/3 - 1) for = is identical to 
the normalization scalar foj- zp for large {K,M). From 
(76) we conclude, that ZF and ORZF do not achieve the same 
sum-rate at asymptotically high SNR. 

C. Zero-forcing Precoding 

For a = 0, the RZF precoding matrix in (21) reduces to the 
ZF precoding matrix G^f which reads 



(78) 



where ^ is a scaling factor to fulfill the power constraint (11). 

To derive a deterministic equivalent of the SINR of ZF 
precoding, we cannot apply the same techniques as for RZF, 
since by removing a row of H, the matrix H'^H becomes 
singular. Therefore, we adopt a different strategy and derive 
the SINR 7fc,zf for ZF of user k and /3 > 1 as 



7fc,rf = lim 7fe rrf. 



(79) 
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The result is summarized in the following theorem. 

Theorem 3: Let (3>1 and 'yk,zf be the SINR of user k for 
ZF preceding. Then 



M- 



almost surely, where 7^ is given by 



7fe,zf 



1 



with 



-J-" = — TrL-\ 



C2/c2 



/3 - C2/C- 



-TrL" 



(80) 
(81) 

(82) 
(83) 
(84) 

(85) 

- • (86) 

Ck-lP 

Note, that by Jensen's inequality 02/0^ > 1 with equality if 
= Im 

Corollary 2: Let = Im, L = Ik and Tk = t then 7^ 
takes the exphcit form 



C2 = Tr02 Im + -s& 
where c is the unique solution of 



c = Tr© + 

Moreover, c = limfe^oo Cfe, where Co = l and, for k > 1, 

-1 



Cfe 



TV© Im + 



7fe,zf - 7z°f = - 1) 



1 



(87) 



Proof of Corollary 2: By substituting = Im and L = 
Ik into (85), c is explicitly given by c = (/3 — 1)//?. Since 
C2/c2 = l wehave*° = T° = l/(/3-l). ■ 

Proof of Theorem 3: Recall the terms in the SINR of 
RZF that depend on a, i.e. toa, ^' and T 
1 

rriA 



where 



, -tr0F 
* = ^tr(F-aF2) 
T = ±tr0(F-aF2), 



(88) 
(89) 
(90) 

(91) 



In order to take the limit a — )• of ^ and T, we apply the 
matrix inversion lemma (MIL) to F in (91) and obtain 

F - aF^ = H" (hh" + uIm) H, (92) 

Since HH^ is non-singular with probabihty one, we can 
take the hmit a ^ of * and T, for such HH", 



^ lim * = — tr ( im") ^ 



T = lim T 

a->-0 



M 



trH0H 



(93) 
(94) 



Note that it is necessary to assume that /3 > 1 to assure that the 
maximum eigenvalue of matrix (HH'^)~^ is bounded for all 
large M, almost surely. Since jtia grows with a as 0(1 /a) 
we have 



7/s,zf 



liin 7fe,rzf 

a->0 



(95) 



Now we derive deterministic equivalents ^° and T° for ^ 
and T, respectively. 

With SAr = 0, Theorem 1 can be directly apphed to find ^° 
s.t. ^ — ^Ll!^ 0, almost surely, as 



H(0) = ^_TrL-, 



(96) 



where c is defined in (85). 

In order to find T°, notice that we can diagonalize in (94) 
s.t. = Udiag(Ai, . . . , Am)U^, where U is a unitary matrix, 
and stiU have i.i.d. elements in the fcth column xi of XU. 



Denoting C = HH^, C[k] 

and applying Lemma 4 twice, equation (94) takes the form 



A,LV2i;i;HLi/2 



1 



M 



(1 + Afex',HLi/2c-^[Li/2x; 



(97) 



Notice that C^j^ does not have uniformly bounded spectral 
norm. Therefore, Lemma 5 can not be applied straightfor- 
wardly. However, since (3 is uniformly greater than 1 + e for 
some e > 0, has almost surely bounded spectral norm 
for all large M. This is sufficient for the following lemma to 
hold. 

Lemma 2: Let Ai, A2, . . ., Ajv € C^^^, be a series of 
random matrices generated by the probability space {Q, T, P) 
such that, for a; e ^ C with P{A) = 1, ||AAr(co')|| < 
K{uj) < 00, uniformly on N. Let Xi , X2, . . . be random vectors 
of i.i.d. entries such that xjv € has entries of zero mean, 
variance 1 /N and finite eighth order moment, independent of 
Ajv. Then 



x'l^AjvXjv 



1 . A ^ 
^trAjv 



'-^0, 



almost surely. 

The proof of Lemma 2 is left to Appendix V. 

Moreover, the assumptions of the rank-1 perturbation 
Lemma 7 are no longer satisfied. Thus, it cannot be ensured 
that the normalized trace of Cj^| and are asymptotically 
equal. In fact, following the same line of argument as above, 
we also have a generalized rank-1 perturbation lemma, which 
now holds only almost surely. 

Lemma 3: Let Ai, A2, . . ., Ajv e C^^^ be deterministic 
with uniformly bounded spectral norm and Bi,B2,..., BatS 
be random Hermitian, with eigenvalues Af " < • • • < 



A^™, generated by the probability space (f2, J^, P), such that, 
: 1, Af"^'"' >e(w), uniformly on N. 



for w e with P{B) ■■ 
Then for veC^, 



^trAjvB-i 



- ^trAjv 



almost surely, where Bj^ 
probability one. 



(B;V + Vv")-l^-^0, 

^ and (Bjv + vv")~^ exist with 
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The proof of Lemma 3 is left to Appendix VI. 

Applying Lemma 2 together with the rank-1 perturbation 
Lemma 3, we obtain 

M 



T - ^TrLC" 



M 



M- 



-(1 + AfeiTrLC-i) 



0, (98) 



almost surely. To determine a deterministic equivalent 

for mc.L(O) = TrLC~'^, we apply Theorem 1 as for 
(96) (again, the definition of mc,L,{z) can be easily extended 
to ^ = 0). For TrLC-2 we have 



TrLC-2 = mc2,L(^) = m'c,L(0). 



(99) 



The derivative of m^L(O) is a deterministic equivalent of 
mcL(O), so that applied to (98), we have f°, s.t. T - 



M- 



0, almost surely, that satisfies 



(100) 



13 - c^lc" 

where c and C2 are defined in (85) and (84), respectively. 
Finally, we obtain (81) by substituting ^ and T in (95) by 
their respective deterministic equivalents (96) and (100), which 
completes the proof. ■ 

V. Sum Rate Analysis Under Limited Feedback 

This section analyzes the behavior of the sum rate under 
the limited-rate feedback hnk. To obtain tractable expressions, 
we restrict the subsequent analysis to homogeneous networks 
(L = \k) without transmit correlation (0 = Im). As previously 
stated for L = \k, we assume = r. In this case and for 
large [K, M) all users have equal SINR and thus optimizing 
the SINR is equivalent to optimizing the per-user rate and the 
sum rate. 

The considered performance metric is the sum rate -Rsum 
defined in (14). Define to be 

K 

Kum = E 1°S2 (1 + 7fe) [bits/s/Hz], (101) 

k=l 

where 7^ equals 7^ ^^.f Ik zf for precoding, 
respectively. From 7^ — 7^ almost surely, we have 

that -^(i?sum — -Rsum) 0, almost surely. From Remark 1 

we infer that, if X is Gaussian we have -Rsum — -Rgum ^^^^^ 0, 
almost surely. 

Subsequently, we will derive the scaling of the distortion 
necessary to approximately maintain a per-user rate gap 
between perfect CSIT and imperfect CSIT. 



A. Optimal Regularized Zero-forcing Precoding 

Consider a SNR-independent distortion r^. As was ob- 
served in [22], [23] for ZF precoding and in [31] for MMSE 
precoding, the sum rate of ORZF precoding saturates at 
asymptotically high SNR. 

Corollary 3: In the conditions of Corollary 1, the approx- 
imate sum rate R^^^ saturates for asymptotically high SNR 
at 



where j°^f is given by 



7rzf 



_ 1 
2 

1-T 



1 _ 3 



if ^ = 1 

^(;3-l)+f-i if^>l, 



(103) 



where x is given by (77). 

Proof: (102) follows directly from Corollary 1 by taking 
the limit p^oo. ■ 
The rate gap per user under ORZF is given in the following 
theorem. 

Theorem 4: Let = Im, L = Ik and be the distortion 
per user. Define Ai?°''^^^ to be the difference between the per- 
user rate of ORZF precoding under perfect CSIT and imperfect 
CSIT. Then Ai?°''^' is given by 

where oj is given in (72) and 
g{x, /3) = xp{^ - 1) + - ^fx^p' + 2(1 + pi)xp + 1. 
Proof: With Corollary 1, simply compute Aii°'''^^ = 

Corollary 4: In the conditions of Theorem 4 and with ^ = 1, 
the per-user rate loss AR°^^ takes the form 



AR 



o.rzf 



l0g2 



1 + y/TTTp 



[bits/s/Hz]. (105) 



Following the work in [22], we extend [22, Theorem 3] to 
ORZF precoding in the following theorem. 

Theorem 5: Let = Im and L = I/^. To maintain a rate 
offset no larger than log2 b (per user) between ORZF with 
perfect CSIT and imperfect CSIT, the distortion has to 
scale approximately with 



^2 ^ Cf 



(106) 



<l>.Ap,b)- (i + ^)^ + ^^_i) + ^(^2_52) .(107) 

w{p,b) = l-b + g{l,p). (108) 

Proof Set A.R°'''^f = log2 b and solve for r^. ■ 
Corollary 5: In the conditions of Theorem 5 with /? = 1, 
the distortion has to scale approximately with 



l + 4p- 



1 



3+^ P 



(109) 



If the SNR grows to infinity, the term (j)*°{{p,b) in (107) 
converges to the following limits. 



lim <^*°f(p,6) 

p^oo 



6^-1 if/3 = l 
6-1 if /? > 1. 



(110) 



Details can be found in Appendix VII-A. 

For a direct comparison of Theorem 5 to [22, Theorem 3], 

we set = 2~^ra^, where B*°f is the number of feedback 
bits per user under ORZF precoding. Thus, (106) takes the 
form 



Kim^ = ^l™ iflog2(l + 7;zf) =^log2(l+7r°zf) (102) B^^f = (M - 1) loga p - (M - 1) logs Cf (p, (111) 
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B. Regularized Zero-forcing Precoding with a = 1/ {13 p) 

Although the RZF-CDU precoder is suboptimal under im- 
perfect CSIT, the results are useful to compare to the work 
in [22]. In [22, Theorem 3] gives the minimum number of 
feedback bits necessary to maintain a rate offset of log2 h 
per user for ZF precoding under random vector quantization 
(RVQ). Moreover, the author claims that [22, Theorem 3] 
holds also true for RZF-CDU precoding, i.e. the optimal RZF 
precoder under the assumption of perfect CSIT. 

For the sake of comparison to [22, Theorem 3], we state 
the following proposition. 

Proposition 3: In the conditions of Theorem 5, under RZF- 

CDU precoding and = 2 , the number of feedback bits 
-B°^f has to scale approximately with 

B°,f = (M - 1) log2 p - (M - 1) log2 Cf (P, b), (112) 

where for asymptotically high SNR, (p°^f{p,b) converges to 
the following Umits. 



lim 



^rzf 



'2(6-1) if;3=l 



[6-1 if 13 > 1. 
The proof is provided in Appendix VII-B. 



(113) 



C. Zero-forcing Precoding 

Corollary 6: In the conditions of Corollary 2, the approx- 
imate sum rate R°^^ saturates for asymptotically high SNR 
at 

K 

E^^ = lim V log2 (1 + 7fe%f ) (1 14) 



p—^OG 



k=l 



= Klog^(^l + ^-^i0-l)y (115) 

Proof: (102) follows directly from Corollary 1 by taking 
the limit p^oo. ■ 
Theorem 6: Let (3 > 1, = Im, L = Iif and be the 
distortion per user. Define Ai?°'^^ to be the difference of 
the per-user rate under ZF precoding of perfect CSIT and 
imperfect CSIT. Then AR°'^^ is given by 



Ai?°'^f = log2 



1 



1 + - 1) 

(l-T^)(/3-l) 

2TT 



[bits/s/Hz]. (116) 



Theorem 7: Let /3 > 1, = Im and L = I^. To maintain 
a rate offset no larger than log2 b (per user) between ZF with 
perfect CSIT and imperfect CSIT, the distortion has to scale 
approximately with 



(6-l)[l+p(/3-l)] 



(117) 

(118) 



For asymptotically high SNR, 4>lf{p, b) in (118) converges to 
lim 0:f°(p,6) = 6-1. (119) 

p^oo 

Under RVQ we have 

= (M - 1) log2 p - (M - 1) log2 {p, b). (120) 



D. Discussion 

At this point we can draw the following conclusions. For 
(3 = 1 the optimal scaling of the feedback bits -B*°f , B°^f and 
B for ZF in [22, Theorem 3] are different, even in the high 
SNR limit. In fact, for RZF-CDU, the upper-bound in [22, 
Theorem 3] is too pessimistic in the scaling of the feedback 
bits. From (112) and (113), a more accurate choice is 

= (M - 1) log2 p-{M-l) log2(2(6 - 1)), (121) 

i.e. M— 1 bits less than proposed in [22, Theorem 3]. However, 
notice that (121) is an approximation and not a bound, i.e. 
a rate gap of log2 6 bits/s/Hz cannot be guaranteed (as [22, 
Theorem 3] does) for finite number of users, but is exactly 
maintained as K, M oo. Nevertheless, simulations show 
that the approximation is accurate for all finite {K,M). 

Moreover, for high SNR, to maintain a rate offset of log2 6 
ORZF requires (M — 1) log2(6+l) bits less than ZF precoding 
and (M- 1) log2(^) bits less than RZF-CDU. Note that the 
result (110) does not converge to the result for ZF precoding, 
i.e. (6—1) [22, Theorem 3], since the optimal regularization 
parameter for ORZF, designed for imperfect CSIT, does not 
converge to zero, and therefore, the ORZF precoder does not 
match the ZF precoder for asymptotically high SNR. 



In contrast, for > 1, we have B^°^ = B°^f 



asymptotically high SNR. Intuitively, the reason is, that for (3 > 
1 the channel is well conditioned and the RZF and ZF perform 
very close. Therefore, both schemes are equally sensitive to 
imperfect CSIT and thus the scaling of is the same for high 
SNR. 

Notice that our model comprises a generic distortion of the 
CSIT. That is, the distortion can be a combination of different 
additional factors, e.g. channel estimation at the receivers, 
channel mismatch due to feedback delay or feedback errors. 
Moreover, we consider i.i.d. block-fading channels, which can 
be seen as a worst case scenario in terms of feedback overhead. 
It is possible to exploit channel correlation in time, frequency 
and space to refine the CSIT or to reduce the amount of 
feedback. 

VI. Applications 

This section presents two applications of the theoretical 
results derived in Section IV. 

In what follows, we use the approximated sum rate (101) 
to compute for ZF, (i) the optimal number of active users 
selected for transmission and, for ZF and ORZF (ii) the 
amount of channel training in TDD multi-user systems re- 
quired to maximize the sum rate. The derived solutions are 
close approximations to the exact solutions to (i) and (ii) 
and their accuracy increases as {K, M) grow large. Despite 
their approximate character, the solutions give valuable insight 
into the system behavior and can also be utiUzed as good 
initialization points for further optimization if {K,M) are 
small. 

A. ZF: Optimal Number of Active Users 

For fixed 0, L, p and t|, we consider the problem of 
finding the number of users K*° (or equally the system load 
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(3*° = M/K*°), such that the approximated sum rate (101) is 
maximized, i.e. 



= argma^i / log2 (l + 7fc,zf) ^^^^"^(0, (122) 

0>i P J 



where we suppose that the user channel gains Ik are distributed 
according to some probability distribution function F^. By 
setting the derivative of (122) w.r.t. (3 to zero, we obtain the 
imphcit equation 

'fe,zf 

and is a solution to (123). 
Proposition 4: Let = Im and L = Iif, then is given 

by 



/ s = / log2 (1 + 7.%f) dF^{l) (123) 



(124) 



where 'W{x) is the single-valued Lambert- W function, defined 
as the unique solution to W(a;) = xe^("\ and 



1 - a - I . _i , 

r2 + - e 



(125) 



The proof of Proposition 4 is given in Appendix Vin. Note 
that lima^i = e. Moreover, only rational values of (3 are 
meaningful in practice. 

B. Optimal Amount of Channel Training in TDD Multi-user 

Systems 

Consider a time division duplex (TDD) system where uplink 
and downUnk access the same channel at different times. 
Therefore, the transmitter estimates the channel from known 
pilot signaling of the receivers. The imperfections in the CSIT 
are caused by (i) channel estimation errors, (ii) imperfect 
channel reciprocity due to different hardware in the transmitter 
and receiver and (iii) the duration for which the channel is 
approximately constant, i.e. the channel coherence time. 

In what follows we assume that the channel is perfectly 
reciprocal and study the impact of (i) and (iii) for = 1^ and 
h = lK. The channel is assumed to be constant over TeM+ 
channel uses which are divided into channel uses for data 
transmission in the downlink and Ti channel uses for channel 
training in the uplink. During the training phase, each user 
sends Tt orthogonal pilot symbols, (which limits the number 
of users to K < Tt), from which the transmitter estimates 
the channel using MMSE channel estimator. Furthermore, we 
assume that the SNR pui of the uplink channel is smaller 
than the SNR pdi of the downlink channel, since the receivers 
usually have less transmit power available. This setup has been 
considered in [35J-[37J. 

Since the sum rate is itself depending on Td and Tt, 
there exists a non-trivial trade off in allocating the channel 
resources between channel training and data transmission. The 
approximated normalized sum rate R, 
form 



^o,zf . 
••sum " 



^Rl;^i takes the 



'■sum ^ ^ 



1 



Tt_ 
T 



l0g2 



1 



1 - T 



-I- 

Pdl 



. (126) 



Similarly, for ORZF we have 

4Ym = if (l - l0g2 (1 + 7rrf) , (127) 

where 7°2f is given in Corollary 1. The distortion in the 
CSIT is solely caused by an imperfect channel estimation in 
the uplink and is identical for all entries of H. To acquire 
CSIT, the users transmit orthogonal pilot symbols over the 
uplink channel to the transmitter, which receives vector 

from user k 

rfe = y^TtMpuihk + rife, (128) 

where ~ CAA(0,Iif) is the noise vector. Subsequently 
the transmitter performs a MMSE estimation of each channel 
coefficient. Due to the orthogonality property of the MMSE es- 
timation, the estimate hij of hi j (z = 1, . . . , if, j = 1, . . . , M), 
is independent of the estimation error hij '^CJ\f{0, Of/M) and 
we have [38] 

hij = hij + hij, (129) 



where the variance /M = E hijh^j of the estimation error 
hij is given by 

(130) 



1 



1 + TiPui 



Substituting r^ = (T^ into (126) and (127) we obtain 



Kul = K {1 



Rl'f^' = K 1 



1 + T,,f^ 




(131) 



(132) 

d=^{l- Pfw-'pli + 2wpdi{l + /3) + 1, (133) 
w = T— ■ (134) 

J- + ^t,rziPul + Pdl 

For /3 > 1 under ZF precoding and /3 > 1 for RZF precoding, it 
is easy to verify that the function R!^^^ and R^^^l are strictly 
concave in and in the interval K < Ti ^^Tj ^zf < 

T, where K is the minimum amount of training due to the 
orthogonality of the pilot sequences. Therefore, we can apply 
standard convex optimization algorithms [39] to evaluate 



T-f = argmax {R^^i} , 

K<Tt,^!<T 

T*°,,= argmax {i?:,^^^} . 

K<Tt y^!<T 



(135) 
(136) 



The uphnk SNR pui and downhnk SNR pdi are dependent 
and we consider that pui / Pdi is constant as both pui and pdi 
grow large. For the limiting cases pdi and pdi 00, we 
obtain the following solutions to (135) and (136). Details can 
be found in Appendix IX. 

At asymptotically low SNR, (135) and (136) have the 
solution (cf. Appendix IX-A) 

r*°f = lim^ argmax R^^^ = ^, (137) 



2' 



^Mzf = lim^ argmax = (138) 

Pdl^O K<Tt r^f<T ^ 
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Fig. 1. ORZF, sum rate vs. SNR with A/ = 32, /3 = 1 and a*°, simulation 
results are indicated by circle marks with error bars indicating one standard 
deviation in each direction. 




Fig. 2. ZF, sum rate vs. SNR with Af = 32, /3 = 2 simulation results are 
indicated by circle marks with error bars indicating one standard deviation in 
each direction. 



That is, the total channel resources T are equally divided into 
channel training and data transmission. 

For asymptotically high SNR the optimal amount of training 
Tj*°f and T'j*°j,( for ZF precoding and RZF precoding, respec- 
tively is given by (cf. Appendix IX-B) 

T^°f= lim argmaxi?:f^ = if, (139) 
T*°,f= lim argmax = if. (140) 

That is, only the minimal amount of training to obtain a 
channel estimate is required. 

VII. Numerical Results 

In this section we compare our theoretical results to Monte- 
Carlo simulations and assume that the entries of X, Q are i.i.d. 
Gaussian distributed. 

We begin by introducing the simulation assumptions. There- 
after in Section VII-B, we compare the approximate results 
of Section IV to Monte-Carlo simulations. The outcome of 
these experiments clearly reveals the accuracy of the proposed 
approximations. Afterwards we present simulation results of 
the ORZF and the two applications discussed in Section VI. 

A. Simulation Assumptions 

The performance of the Monte-Carlo simulations is aver- 
aged over 10,000 independent Rayleigh block-fading channels. 

1 ) Correlation Model: The transmit correlation is assumed 
to originate from a dense antenna packing at the transmitter 
Under a rich scattering environment we use the classical Jakes' 
model, where the correlation between antenna i and j is 
depending on their distance dy , i,j ^1,2, ... , M. Thus, we 
have [40] 

= Jo , (141) 



where Jq is the zero-order Bessel function of the first kind and 
A is the signal wavelength. In particular we consider a uniform 
circular array (UCA) of radius r [41]. To ensure that Am grows 
slower than 0(M), we suppose that the distance between 
adjacent antennas d = rfi,i+i is independent of M. Thus, the 
radius r of the UCA scales with M as r = (i/(2 sin(7r/M)). 
The distance dij between antennas i and j is given by 

d,, ^2rsm(^J-^j^y (142) 

In that case is a circulant matrix. 

2) Path Loss Model: In order to model the channel gains 
Ik we consider a circular cell of radius Vc = 500 meter and 
assume that the K users are uniformly distributed over the cell 
area [42]. The path loss is computed according the "Suburban 
Macro" scenario which is based on the modified COST231 
Hata urban propagation model and defined in [43], as 

Zfe = -(31.5 + 351ogiodfe) [dB], (143) 

where dk is the distance of user k to the transmitter. We 
normalize the Ik s.t. Elk — lto ensure that the average receive 
power is identical for all users. The notation L^I^f indicates 
that the Ik are distributed according to (143). 

B. Deterministic Equivalent of Sum Rate for ORZF and ZF 

Figures 1 and 2 compare the sum rate performance of the 
approximated sum rate to Monte-Carlo simulations for RZF 
and ZF, respectively. The error bars indicate one standard 
deviation of the simulation results in each direction. For 
uncorrelated antennas and equal path loss (0 = 1^/, L = Ia'), 
all Tk are identical. On the contrary, if L 7^ Ik, we perform 
a (suboptimal) water-filling algorithm on the per user SNR 
Pk = hp/M subject to a total of i? = lOQK bits (corresponding 
approximately to = 0.1, if B is allocated equally among the 
K users) for RVQ of the user channels. The r| obtained by 
the water- filling algorithm are referred to as r| . 
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Fig. 4. RZF, ergodic sum rate vs. SNR, with B bits per user to maintain a 
sum rate offset of Xlogj 6=10 and = Im, L = Ix, M = 10, /3 = 1. 

We observe that the expressions derived for large {K, M) He 
approximately within two standard deviations of the simulation 
results even for finite {K,M). Therefore, the approximations 
derived in Section IV are accurate and can be applied to 
concrete optimization problems for the multi-user downlink 
channel. 

1) RZF under Limited Feedback: Figure 3 shows the er- 
godic sum rate with error variance —0.1 for various transmit 
filters. We consider two RZF filters, ORZF using the sum rate 
maximizing regularization term a*° in (67) and RZF-CDU 
with a — l/{l3p), i.e. designed based on perfect CSIT. For 
comparison we also plot the performance of the MMSE filter 
proposed in [31], which has an identical structure as ORZF but 
with a = T^/^ + and the ZF filter. We observe that as 

soon as the error variance dominates over the noise power 
(7^ the ergodic sum rate of the RZF-CDU filter decreases and 
approaches ZF precoding for high SNR. We further notice that 



the ORZF and MMSE filters achieve similar performance. The 
reason is that, for small values of r^, both filters are almost 
identical and the sum rate is not sensitive to small errors in 
in the filter design [31]. 

Figure 4 depicts the ergodic sum rate of RZF precoding 
under RVQ with B feedback bits per user. All values of B are 
rounded to the next higher integer. To avoid an infinitely high 
regularization term a*°, the minimum number of feedback bits 
is set to one. We observe that the desired sum rate offset of 
10 bits/s/Hz is approximately maintained over the given SNR 
range when B is chosen as in (111) under ORZF precoding. 
Given an equal number of feedback bits, it can be seen that 
the ORZF precoder achieves a significantly higher sum rate 
compared to the RZF-CDU for medium and high SNR, e.g. 
about 2.5 bits/s/Hz at 20 dB. Furthermore, to maintain a sum 
rate offset of M bits/s/Hz, [22] proposed a feedback scaling of 
B^ ^^3~ ^ PdB ■ From Figure 4 we observe that this scaling is 
very pessimistic, since the sum rate offset is about 6 bits/s/Hz. 
Thus, for RZF-CDU precoding the scaling of the number of 
feedback bits proposed in (111) proves to be more accurate. 

2) ZF: Optimal Number of Active Users: Figure 6 com- 
pares the optimal number of active users K*° in (124) to 
the optimal number of active users K* obtained from Monte- 
Carlo simulations-, whereas Figure 7 depicts the impact of a 
suboptimal number of active users on the ergodic sum rate of 
the system. 

From Figure 6 it can be observed, that (i) the approximated 
results K*° do fit well with the simulation results even 
for small dimensions, (ii) {K*,K*°) increase with the SNR 
(iii), for t2 ^ 0, {K*,K*°) saturate at high SNR and (iv) 
introducing correlation and path loss leads to larger dispersion 
of {K*,K*°) over the selected SNR range. 

From Figure 7 we observe that, (i) the deterministic equiv- 
alent K*° achieve most of the sum rate and (ii) adapting 

^Note that we do not perform any user scheduling i.e. we do not test all 
possible combinations of K CM since that would alter the effective channel 
distribution. 
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■K*° = M/l3*°, (124) 
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Fig. 6. ZF, sum rate maximizing number of active users vs. SNR with 

T^=0.1. 
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Fig. 7. ZF, ergodic sum rate vs. SNR witii M = 16 and =0.1. 

the number of users is beneficial compared to a fixed K. 
Moreover, from Figure 6 we identify K = 8 as a good choice 
(for = 0.1) and, as expected, the performance is optimal 
in the medium SNR regime and suboptimal at low and high 
SNR. The situation changes by adding correlation and path 
loss. Since K ~8 is highly suboptimal for low and medium 
SNR (cf. Figure 6) we observe a significant loss in sum rate in 
this regime. Consequently, the number of active users must be 
adapted to the channel conditions and the approximate result 
K*° is a good choice to determine the number of active users 
in the system. 

3) Optimal Amount of Channel Training in TDD Multi- 
user Systems: Figure 8 depicts the optimal relative amount 
of training jT for ZF and ORZF precoding. We observe 
that jT decreases with (i) increasing SNR and (ii), for 
fixed SNR, with increasing T. That is, for increasing SNR the 
estimation becomes more accurate and resources for channel 
training are reallocated to data transmission. Furthermore, 



Fig. 8. Optimal amount of training for ZF and ORZF, T*°/T vs. downlink 
SNR with © = Im, L = Ik, M = 32, = 16, pui = O.lpM, ORZF is 
indicated by circle marks. 

/T saturates at K/T due to the orthogonality constraint 
on the pilot sequences. Note that T^" /T depends on the ratio 
P and only the saturation level depends on a specific number 
of users K. The fact that increasing the channel coherence 
block T reduces the relative amount of training /T has 
also been observed in [44] under a different model for t'^. 

Furthermore, we observe that the optimal amount of training 
is less for ORZF than for ZF precoding. This is due to the 
distortion-aware design of the ORZF. Moreover, the relative 
amount of training /T of both ZF and ORZF converges for 
high and low SNR, to 1/2 and K, respectively, as predicted 
by the theoretical analysis. 

Figure 9 compares the normalized ergodic sum rates of ZF 
^sum with optimal training T^°^ for a fixed number of users 
K = \Q and the optimal number of users K = K*° from 
(124). Since, for a fixed M, T^°^ depends on (5 and K and 
depends on Tt (through r) we apply an iterative algorithm 
to find a joint optimum {T*°f, That is, we fix Tim while 
optimizing /? and vice versa until convergence, which normally 
occurs after a few iterations. We observe a significant gain in 
the high SNR regime when K is optimized jointly with Tt- 
This gain increases for increasing T, since small values of T 
limit Tt to K (cf. Figure 8) and thus constrain the optimization. 
On the other hand, for higher values of T (e.g. T = 1000) 
this saturation effect occurs at much higher SNR and Tt is no 
longer limited by K which increase the gain relative to a fixed 
K. 

VIII. Conclusion 

In this paper we derived deterministic equivalents of the 
SINR of ZF and RZF precoding by applying recent results 
from large dimensional random matrix theory. These ap- 
proximations are valid for any SNR and shown to be very 
accurate even for finite dimensions. Therefore, they provide 
useful tools for many applications, such as the sum rate 
maximizing number of users in a cell or the optimal ratio 
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with M = 32, and pui=^-^pdi- 



o,zf 
•sum 



vs. forward SNR for Tt^^t = T* 



where 



l + TjyH(B[j] -zlAr)-iyj 



(l/iV)trRAr(B[j] - zl 



N) 



^ED 



1 



with B 



B 



N 



Tje{z) 
* and E 



(149) 



and 



jyjYj ana Jii = l^v wnen a is m 
E = RAr when a is e. The rest of the proof unfolds as in [34], 
where the authors consider Qjv = lAf- Since Qjv is uniformly 
bounded for all N it does not change the rate of convergence 
of wj'i and wlj in (147). Thus (144) holds true and TrD"! 
is a deterministic equivalent of tobjv,Qjv(^)- 

The proof of the existence of e{z) is equivalent to that given 
in [34]. Furthermore, the uniqueness of e{z) for > 

has also been proved in [34]. Since we evaluate ,Qn{^)° 
for z < we prove the uniqueness of e{z) for z < in the 
following subsection. 



between channel training and data transmission. In particular, 
we propose a RZF precoder which takes the information 
about transmit correlation, path loss and CSIT mismatch into 
account. Furthermore, we find that, given a target user rate, 
the improved precoder design can significantly decrease the 
amount of necessary feedback rate. 

Appendix I 
Proof of Theorem 1 

A. Proof of convergence 

The proof is an adaptation of the proof provided in 
[34]. We find a deterministic approximation m'^^ Qn^"^) 
mBjv.Q]v(-^) of the form TrD^^, such that 



B. Proof of Uniqueness of e{z) 

Let (c, c) and (e, e) be two solutions of equations (4) and 
(5), respectively. We prove the uniqueness of e{z) by showing 



R]v^'(cRAr + S 



that e — e = 0. Denote A — i^jy 



1/2 



N - 

B 



zl 



n) "^R-jv 



-at) ---n 

8, we have 



^tU^ andB: 



^1/2 
■N 



1/2^ 
+ 



(I„ + eTjv) 



1, 



'^t]I^. With Lemma 



c-c = -(e-e)-TrBB 
e-e = -(c-c)TrAA. 



Substituting (150) into (151) we obtain 

1 



1 - -^TrAATrBB^ 0. 



(150) 
(151) 

(152) 



TiQat (Bat - zIn) ^ ~ TrD ^ 0, (144) in order to show that e - e = 0, it is sufficient to show that 



almost surely. Let ~ diagl^i, . . • ,t„) and B^v ~ Sat + 
Ej^iT-jyjyj^ with Yj = {1/^)R]^^Xj. now we apply 
Lemma 8 to compute 



Qjv (Bat - zljv) '-D-i = 

[D - (Bjv - 2lAr)Qjv'] Qn (Bat - zIat)" 

We choose D = (Sjv — zljy — zpjvR-Jv) Q^r^ with 



T] = iTrAATrBB < 1. Applying the inequality |TrXY| < 
VTrXXHTi-YYH to r;, we obtain 



(153) 

iTrAA^TrBB^. It 



where r/i = ^TrAA^TrBB" and 772 ^ 



(145) remains to prove that 771, 772 < 1- We rewrite (5) and (4) as 



cTrAA^ 



PN 



n 

zBn ^ 1 



/371 ^ 1 + r,-e(z) ' 
.7=1 ■' 



(146) 



c = e-TrBB^ +7;2, 

P 



(154) 
(155) 



where v\ = TrR]^^ (cRat 



Similarly to [34], by taking the trace and dividing by (l/iV), 
equation (145) takes the form 



1 " 

TrD-i-77.B„.Q„.W = -E^J-^J 

j=i 

1 " 

and TrD'^Rw - e(z) = - ^ Tjd] 



w 



M 



(147) 
(148) 



Sat - zl^y^i^N - zlN){cIiN + 
Sat - zlAr)"^Rjv^'' and 7;2 = iTrTArlln + eTAr)-^ Notice 
that vi,V2> since z < and the trace of a Hermitian positive- 
definite matrix is always positive. Substituting (155) into (154) 
we obtain 

e(l-77i)>0. (156) 

For e > 0, equation (156) implies that 771 < 1. Similarly we 
have 7/2 < 1 and therefore 7; < 1 which completes the proof. 
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Appendix 11 
Proof of Proposition 1 

The strategy is as follows: First we show the sure conver- 
gence of th e sequence {ck} for < eo < — 1/^ to e{z) if 
< |^;| < Y^-illRjvlPllTArP. Then we apply VitaU's conver- 
gence theorem [45, p. 169] to extend the sure convergence 
to all z < by proving that all Ck are images of Stieltjes 
transforms at z. 

The first step is similar to the procedure in Appendix 

1/2 
■JV ' 



Similarly to (160), we apply Lemma 8 to x^UVy. Thus, 
we obtain an expression involving the terms x'^UA~^x, 
y^A~^y, x^UA~^y and y^A~^x. To complete the proof 
we apply Corollary 7 and Corollary 6, with u= ;^trA~^ and 
M' = ;^trUA~^ and we have 



x^UVx - 



(cqCi - c^u'^ + (co + ci)u + 1 
almost surely. Similarly we have 



'-^0, (162) 



I-B. Denote Afe_i = {ck-iR-N + Sn - zIn) ^Rjv , ^ 
A„ = r]^/'(c„Rjv + Sat - ^Iiv)-iR]v^', Bfe_i =T;y/'(I„ + ^Vy 



-C2UU 



JV-foo 



(8) and (9) we obtain 



_rri/2/ 



-^t]J'^. From 



(cqCi — c^)u^ + (co + ci)u + 1 



0, (163) 



Cfc — efc+1 



1 



efc-i - Bk 
which can be upper-bounded as 

1 



= -TrAfc_iA„Bfe_iBfe = r], 



■q 



< 



R;vlPl|T»-"2 



-N 



|2- 



(157) 



(158) 



If \z\ < y^-jllRjvlPllTivP then r]< 1 and {e^} is a Cauchy 
sequence which converges surely. 

Suppose e/c is a Stieltjes transforms at z. To prove that 
Cfc+i is a Stieltjes transform, we need to verify the following 
conditions [46, Proposition 2.2]: (i) 5[efc+i(z)] > (ii) 
z^[ek+i{z)] > and (iii) \imy^oo[iyek+i{iy)] < oo, where 
y = Q[z]. We rewrite (8) as 



efc+i = iTrAfeA^BKB^efe + 



TrAftA^i;i+z;2, (159) 



where vi = TrTAr(I„ + efeTAr)-i(I„ + e*TAr)-i and V2 = 

Tl-RNick'RN + S-zlN)-^{c*kRN + S-Z*lN)-\S-Z*lN).lt 

is easy to show that Sk+i in (159) verifies all three conditions 
if efc(z) is itself a Stieltjes transform. 

Since {ck} is a sequence of Stieltjes transforms, it is 
uniformly bounded and analytic on all compact sets of C\M+, 
in particular on all sets [—x,—l/x], x > 0. Moreover, 

?jv|p) contains an infinite countable number 



(o,^^l|Rjv| 



of points. Therefore, we can apply Vitali's convergence theo- 
rem [45, p. 169] which proves that {e^} is surely converging 
to the unique solution e(2;) for all 2:<0. 

Appendix III 
Proof of Lemma 1 

Denote V = (A + cqxx"^ + ciyy"^ + C2xy'^ + C2yx'^) ^. 
Now x'^UVx can be resolved using Lemma 8 

x^UVx - x'^UA^^x = x^UV (V"i - A) A^^x = 
- x^UV(coxx^ + ciyy^ + C2xy" + C2yx")A~^yi. 



(160) 



Equation (160) can be rewritten as 
x"UVx = 

x"UA-^x - x^^UVy(ciy"A"ix + caX^A'^x) 
1 + cqx^A-^x + C2y" A-^x 



. (161) 



almost surely. Note that as co,ci,C2 S M"'" and cqCi > c|, 
equations (162) and (163) hold since then ((cqCi — c^u^ -|- 
(co -|- ci)u + 1) is bounded away from zero. 

Appendix IV 
Important Lemmas 

Lemma 4: [47, Lemma 2.2] Let U be an A'^ x A?^ invertible 
matrix and x e C^, c € C for which U -|- cxx^ is invertible. 
Then 

x" (U + cxxH)-^ = ^fZfl, ■ (164) 

Lemma 5: [48, Lemma 2.7] Let U G qN-kn ^ complex 
matrix with uniformly bounded spectral norm, with respect to 
A'^. Let X s have i.i.d. complex entries of zero mean, 
variance 1 /N and finite 8th order moment. Then, 



E 



x^Ux - — trU 

N 



< 



(165) 



where c is a constant independent of A'^ and U. 
Corollary 7: In the conditions of Lemma 5, 



xHUx-ltrU'^-^O, 



(166) 



almost surely. 

Lemma 6: Let U be as in Lemma 5 and x, y G be 
mutually independent with standard i.i.d. entries of zero mean, 
variance 1/N and uniformly bounded 4th order moment. Then, 



y"Ux 



N- 



0, 



(167) 



almost surely. 

Proof: Remark that E |y'^Ux|^ < c/N"^ for some constant 
c > independent of N. The result then unfolds from the 
Markov inequality and the Borel-Cantelli Lemma [49]. ■ 
Lemma 7: [50] Let C > 0, X G ^ ^ Hermitian nonnega- 
tive definite, reM and zeC^. Then 

|tr ((X + QIm)-^ - (X + Tzz^ + Cliv)-') I < ^. 

Lemma 8: Let U and V be two invertible complex matrices 
of size NxN. Then 



U" 



(168) 
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Appendix V 
Proof of Lemma 2 

The proof unfolds from a direct application of Tonelli's 
Theorem [49]. Denoting {X, X, Px) the probabiUty space that 
generates the series xi, X2, . . ., we have that for every oo G A 
(i.e. for every realization Ai(w), A2(w), . . . with w G A), 
Corollary 7 holds. From Tonelli's Theorem, the space B of 
couples {x,co)gX X fl for which Corollary 7 holds, satisfies 

/ lB{x,Lj)dPxxn{x,cj) = lB{x,uj)dPx{x)dP{i^) 
Jxxn JnJx 

If (jj G A, 1b{x,oj) = 1 on a subset of X of probability one. 
Therefore, the inner integral equals one. Since P{A) — 1 the 
outer integral also equals one, and the result is proved. 

Appendix VI 
Proof of Lemma 3 

The proof unfolds similarly as for Lemma V. For oj G B, 
the smallest eigenvalue of Bjv(w) is uniformly greater than 
e = e{u}). Therefore Bjv(ix)) and Bjv(w) + vv^ are invertible. 
Taking 2 = — e(w)/2, we write 

1 

N' 



ptrA;vB-i(a;) = 



^trA^v ( 



and 



—tr A AT (Bat (a; 
^trA;v 



,Hn-1 



BAr(w)+vv^^ - -Ijv 



In 



Under these notations, 'Bn{uj) — ^I^ and Bjv('i^)+vv 
are still nonnegative definite for all N. Therefore, the rank- 
1 perturbation Lenrnia 7, can be applied for this oj. Then, 
from Tonelli's Theorem [49], in the space that generates the 
couples ((xi,X2, . . .), (Bi, B2, . . .)), the subspace, where the 
rank-1 perturbation lemma appUes has probability one, which 
completes the proof. 

Appendix VII 
Details on Limiting Cases in Throughput Analysis 

A. Optimal Regularized Zero-forcing Precoding 

For 13=1 observe that <?(!, p) scales as 4p. Thus, for p 
00, (107) converges to 6^ — 1. 
If j3>l, the term g{l,p) takes the form 

5(1, p) = (/3-l)p+|l-/3|p (1 + 0(1)) 2p{p-l). (169) 

Therefore, for p — )• 00, (107) converges to 6 — 1. 

B. Regularized Zero-forcing Precoding with a = \/{j3p) 
With Theorem 2 for a = l/(/3p) calculate the rate gap per 

user log2( ^t+7'rlf'"(r)°^ ) fo"" '^'^ = 2^5^, equate it to logs h 
and solve for B^'^^. We obtain (112) with 



'/'r°zf(P:&) = P 



- 1 - 7*)[(1 + m°)2i + 1] + {m°fb 



*(1 - 6 + 7*)[(1 + m°)2 - 1] + (m°)26 ' 

(170) 

where7*=7^,..f(T2=0). 



For /? = 1, the terms c, m°, and 7* scale as \/^/p, 
Y^p, Y^/2 and y^, respectively. Therefore, (j)°^f{p,b) ^-^^ 
2(6-1). 

If 13 >1, for large p, the term c can be expanded as 

p/3(/3-l) + 4p2/3(i_/3)- ^1^1^ 

The term m° scales as and and 7* scale as p{j3—l). 

Note that the SINR 7* converges to the SINR of ZF precoding 
(87), for = and p — ^ 00. With this approximation we obtain 

Appendix VIII 
Proof of Proposition 4 

For = Im and L = lif from Corollary 2, Equation (81) 
takes the form 



7fc,zf 



1 - r 



r2 + 



r(/3-i)- 



For equation (123) we obtain 

af3 



l + a{p- 1) 
1 — ^ 

where a = ^2^1 ■ Denoting 



log2 (1 + a{p - 1)) , 



a 



1 



a(/3- 1) + 1 
we can rewrite (173) as 



and X 



a-1 



(172) 



(173) 



(174) 



(175) 



Notice that w{(3) = >V(a;), where >V(x) is the Lambert W- 
function defined for every z eC as z = W{z)e^'^^\ Therefore, 
by solving w{l3) = W{x) we have 



= 1 - 



(176) 



For r e [0, 1], ^ > 1 we have w>—l and x G [— e~^, cxd). 
In this case the relation z = W{z)e^'^'^^ is single-valued and 
W{x) is well defined. 

Appendix IX 

Details on Limiting Cases for Optimal Training 

A. Low SNR Regime 

For ZF, applying Taylor expansion around pdi = 0, equation 
(131) can be written as 

Tt,z{\ Tt^^fPdiPuiW - 1) 



iZf 



1 



log 2 



+ 0(1). (177) 



The terms Rf^^ (131) and (l - ^) Tt,,fPdiPui{l3 - 1) 
are both strictly concave in Tj and o(l) is understood to 
converge uniformly to zero w.r.t. Tf^^f G [K,T]. Therefore, 

argmaxi?^^^ 

-argmax(^l-— j — > 0. (178) 
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The maximum of (1 - Tt,,f/r) Tt^^iPdiPuiijS - 1) is in T/2. 

For RZF precoding, write (134) as w~cTt.r7,fPdi + o{l) and 
(133) as d=l + crt,rzf(l + + where c = pui/pdi- 
Then, (132) can be written as 

Rsu^-[^-^) log 2 ^^^^^ 

With the same arguments as for ZF precoding, maximizing 

(1 - Tt,,,f/T) Tt,,iPpl yields T*°^,^T/2. 

B. High SNR Regime 

For ZF and for large pdi, equation (131) can be written as 

4tm =k(^1- ^) l0g2 Pdl [1 + 0(1)] (180) 

If Tt^z{>K, then the ratio 

(l - %l) l0g2 Pdl [1 + 0(1)] ^ (l - %i) [1 + o(l)] 
if (l-f)l0g2 [1 + 0(1)] " (1-f) [1 + 0(1)] 

(181) 

is asymptotically smaller than one. Therefore, T^°^ = K. 

Likewise, for RZF, by writing w = cTt^rzf/(l+crt i.zf)+o(l) 
and rf=(/3- l)pdi i+cTrL +''(!)' where c = pui/pdi, (132) 
takes the form 

Rl^^ = k[i- ^) log2 Pdl + 0(1). (182) 

With the same arguments as for ZF, T^°^f = K. 
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